Special Issue on Advances in Data Mining and Robust Statistics
نویسندگان
چکیده
The main aim of data mining is to extract knowledge from, usually very large, datasets. Data mining techniques are often applied to gain initial insights about the data and complement statistical models. This special issue focuses on the interface between data mining and statistical modelling, with special emphasis on robust statistics. Very large datasets, especially those that are machine generated and undergo limited quality control, are likely to contain outliers and anomalous measurements. The analysis of such datasets requires statistical approaches that are both computationally efficient and robust against outliers and mild departures from model assumptions. The broad scope includes, but is not limited to, visualization techniques for very large and complex data, including relational data, data analysis algorithms including optimisation and search techniques, methodologies to draw inference on patterns and subgroups, robust models, outlier detection methods, and the analysis of dependencies.
منابع مشابه
Special Issue on Data Mining and Pattern Analysis in Computational Bioscience
This special issue provides a collection of papers that report recent advances in computational bioscience with a focus on biological pattern discovery and data mining. The special issue begins with a meeting report, followed by five articles. In “The DNA–Proteome: Recent advances towards establishing the protein–DNA interaction space,” Erich Grotewold and Herbert Auer summarize findings from t...
متن کاملThe Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran
The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...
متن کاملA robust least squares fuzzy regression model based on kernel function
In this paper, a new approach is presented to fit arobust fuzzy regression model based on some fuzzy quantities. Inthis approach, we first introduce a new distance between two fuzzynumbers using the kernel function, and then, based on the leastsquares method, the parameters of fuzzy regression model isestimated. The proposed approach has a suitable performance to<b...
متن کاملRobust production scheduling in open-pit mining under uncertainty: a box counterpart approach
Open-Pit Production Scheduling (OPPS) problem focuses on determining a block sequencing and scheduling to maximize Net Present Value (NPV) of the venture under constraints. The scheduling model is critically sensitive to the economic value volatility of block, block weight, and operational capacity. In order to deal with the OPPS uncertainties, various approaches can be recommended. Robust opti...
متن کاملSpecial issue on imprecision in statistical data analysis
Different sources of imprecision in connection with both empirical data or models may arise in statistical data analysis. Such an imprecision is frequently connected with linguistic data, expert opinions, perceptions as well as various kinds of ill-observed data or non-precisely defined concepts. Fuzzy sets, intervals, belief functions, random sets or imprecise probability models have been used...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 93 شماره
صفحات -
تاریخ انتشار 2016